An HMM-based system for automatic segmentation and alignment of speech

نویسنده

  • Kåre Sjölander
چکیده

A system for automatic time-aligned phone transcription of spoken Swedish has been developed. Using a speech recording and an orthographic transcription of the words spoken in the recording the system is able to generate a phone-level segmentation without manual intervention. The system uses a technique based on Hidden Markov Models to position 85.5% of all boundary positions within 20 ms of manually segmented reference boundaries on a set of test recordings.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Segmentation Combining and Spectral Boundary

Currently, AT&T Labs’ Natural Voices multilingual TTS system produces high-quality synthetic speech with a largescale speech corpus [1]. In the development of such systems, automatic segmentation constitutes a major component technology. The prevalent approach for automatic segmentation in speech synthesis is Hidden Markov Model (HMM) based. Even though an HMM-based approach is the most automat...

متن کامل

Phonetic alignment: speech synthesis based vs. hybrid HMM/ANN

In this paper we compare two different methods for phonetically labeling a speech database. The first approach is based on the alignment of the speech signal on a high quality synthetic speech pattern, and the second one uses a hybrid HMM/ANN system. Both systems have been evaluated on French read utterances from a speaker never seen in the training stage of the HMM/ANN system and manually segm...

متن کامل

HMM-based automatic visual speech segmentation using facial data

We describe automatic visual speech segmentation using facial data captured by a stereo-vision technique. The segmentation is performed using an HMM-based forced alignment mechanism widely used in automatic speech recognition. The idea is based on the assumption that using visual speech data alone for the training might capture the uniqueness in the facial component of speech articulation, asyn...

متن کامل

Automatic speech segmentation and verification for concatenative synthesis

This paper presents an automatic speech segmentation method based on HMM alignment and a categorized multiple-expert fine adjustment. The accuracy of syllable boundaries is significantly improved (72.8% and 51.9% for starting and ending boundaries of syllables, respectively) after the fine adjustment. Moreover, a novel phonetic verification method for checking inconsistency between text script ...

متن کامل

Automatic segmentation of speech based on hidden Markov models and acoustic features

An accurate database segmented and labeled at phonetic, subword or word level is very important for speech research. However, manual segmentation and labeling is a time consuming and error prone task. This paper describes an automatic procedure for the segmentation of speech in a set of acoustic sub-words units: given either the linguistic or the phonetic content of a speech utterance, the syst...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003